-
Notifications
You must be signed in to change notification settings - Fork 594
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Discussion: add update_time
column to source state table
#13437
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks, cc: @cyliu0
so for each input row, there will be a unique timestamp generated for it, right?
Precisely, the row can be unchanged as before since we employ the heartbeat event of debezium, so if upstream database hasn't been updated for a long time, the offset in the heartbeat will remain the same. |
Just leave a comment for visibility I think it is okay to use i.e. during PG -> CDC -> RW, after all of the data has been updated to PG and But one problem is that it seems the user cannot use this trick to monitor a pipeline where the PG still gets updated and CDCed as there is no notion of Moreover, if potential solution: |
@@ -117,9 +117,16 @@ impl Source { | |||
sub_fields: vec![], | |||
type_name: "".to_string(), | |||
}; | |||
let update_time = Field { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will this affect the schema of the already created source/table? Is this change backward-compatible?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If not, I suggest we add the update time field in the json.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This pr won't merge, yesterday we found that debezium can capture the event time on upstream, so we decide to generate a (process_time - event_time
) metric to promethues as an indication of lagging. FYI
#13440
update_time
column to source state tableupdate_time
column to source state table
update_time
column to source state tableupdate_time
column to source state table
I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.
What's changed and what's your intention?
To improve observability and facilitate the measurement of source freshness
related: https://github.com/risingwavelabs/risingwave-docs/issues/1513
Checklist
./risedev check
(or alias,./risedev c
)Documentation
Release note
If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.